Partial Least Squares with Structured Output for Modelling the Metabolomics Data Obtained from Complex Experimental Designs: A Study into the Y-Block Coding

نویسندگان

  • Yun Xu
  • Howbeer Muhamadali
  • Ali Sayqal
  • Neil Dixon
  • Royston Goodacre
چکیده

Partial least squares (PLS) is one of the most commonly used supervised modelling approaches for analysing multivariate metabolomics data. PLS is typically employed as either a regression model (PLS-R) or a classification model (PLS-DA). However, in metabolomics studies it is common to investigate multiple, potentially interacting, factors simultaneously following a specific experimental design. Such data often cannot be considered as a "pure" regression or a classification problem. Nevertheless, these data have often still been treated as a regression or classification problem and this could lead to ambiguous results. In this study, we investigated the feasibility of designing a hybrid target matrix Y that better reflects the experimental design than simple regression or binary class membership coding commonly used in PLS modelling. The new design of Y coding was based on the same principle used by structural modelling in machine learning techniques. Two real metabolomics datasets were used as examples to illustrate how the new Y coding can improve the interpretability of the PLS model compared to classic regression/classification coding.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Partial least squares- least squares- support vector machine modeling of ATR-IR as a spectrophotometric method for detection and determination of iron in pharmaceutical formulations

Iron is an essential element used as supplement in different dosage-forms. Different time and expenditure-consuming methods introduced for detection and determination of elemental ions such as atomic absorption. In this research, two different and routine methods containing ATR-IR and atomic absorption were applied to define the amount of iron in 198 samples containing different concentrations ...

متن کامل

Partial least squares- least squares- support vector machine modeling of ATR-IR as a spectrophotometric method for detection and determination of iron in pharmaceutical formulations

Iron is an essential element used as supplement in different dosage-forms. Different time and expenditure-consuming methods introduced for detection and determination of elemental ions such as atomic absorption. In this research, two different and routine methods containing ATR-IR and atomic absorption were applied to define the amount of iron in 198 samples containing different concentrations ...

متن کامل

Component-based Predictive and Exploratory Path Modeling and Multi-block Data Analysis

This discussion paper will focus on the predictive modeling of relationships between latent variables in a multi-block data framework. We will refer to component-based methods such as Partial Least Squares Path Modelling, Generalized Structured Component Analysis as well as to some of their recent variants and other alternatives. We will compare these approaches by paying particular attention t...

متن کامل

Identification and Structural Pattern of Top Managers' Psychological Traits in the Oil Industry

Nowadays, psychological traits of managers are among the factors that influence the success of organizations and are a key element of human resource empowerment. The current research aimed to identify psychological traits of Top managers in Oil fields of West Azerbaijan province which has been studied by the combined method (quantitative - qualitative). In the qualitative section, postmodern pa...

متن کامل

Removal of Brilliant Green and Crystal violet from Mono- and Bi-component Aqueous Solutions Using NaOH-modified Walnut Shell

In the present work, the simultaneous determination of Brilliant green (BG) and Crystal violet (CV) dyes with overlapped absorption spectra in binary mixture solution, was carreid out using the partial least squares (PLS) and direct ortogonal signal correction-partial least squares (DOSC-PLS) methods. The results obtained indicate that by applying DOSC on the calibration and prediction data for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2016